Overview

Brought to you by YData

Dataset statistics

Number of variables9
Number of observations768
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory54.1 KiB
Average record size in memory72.2 B

Variable types

Numeric8
Categorical1

Alerts

Age is highly overall correlated with PregnanciesHigh correlation
Insulin is highly overall correlated with SkinThicknessHigh correlation
Pregnancies is highly overall correlated with AgeHigh correlation
SkinThickness is highly overall correlated with InsulinHigh correlation
Pregnancies has 111 (14.5%) zeros Zeros
BloodPressure has 35 (4.6%) zeros Zeros
SkinThickness has 227 (29.6%) zeros Zeros
Insulin has 374 (48.7%) zeros Zeros
BMI has 11 (1.4%) zeros Zeros

Reproduction

Analysis started2025-03-13 08:20:20.678067
Analysis finished2025-03-13 08:20:28.017696
Duration7.34 seconds
Software versionydata-profiling vv4.12.2
Download configurationconfig.json

Variables

Pregnancies
Real number (ℝ)

High correlation  Zeros 

Distinct17
Distinct (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.8450521
Minimum0
Maximum17
Zeros111
Zeros (%)14.5%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:28.084671image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q11
median3
Q36
95-th percentile10
Maximum17
Range17
Interquartile range (IQR)5

Descriptive statistics

Standard deviation3.3695781
Coefficient of variation (CV)0.87634133
Kurtosis0.15921978
Mean3.8450521
Median Absolute Deviation (MAD)2
Skewness0.90167398
Sum2953
Variance11.354056
MonotonicityNot monotonic
2025-03-13T17:20:28.182616image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=17)
ValueCountFrequency (%)
1 135
17.6%
0 111
14.5%
2 103
13.4%
3 75
9.8%
4 68
8.9%
5 57
7.4%
6 50
 
6.5%
7 45
 
5.9%
8 38
 
4.9%
9 28
 
3.6%
Other values (7) 58
7.6%
ValueCountFrequency (%)
0 111
14.5%
1 135
17.6%
2 103
13.4%
3 75
9.8%
4 68
8.9%
5 57
7.4%
6 50
 
6.5%
7 45
 
5.9%
8 38
 
4.9%
9 28
 
3.6%
ValueCountFrequency (%)
17 1
 
0.1%
15 1
 
0.1%
14 2
 
0.3%
13 10
 
1.3%
12 9
 
1.2%
11 11
 
1.4%
10 24
3.1%
9 28
3.6%
8 38
4.9%
7 45
5.9%

Glucose
Real number (ℝ)

Distinct136
Distinct (%)17.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean120.89453
Minimum0
Maximum199
Zeros5
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:28.305546image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile79
Q199
median117
Q3140.25
95-th percentile181
Maximum199
Range199
Interquartile range (IQR)41.25

Descriptive statistics

Standard deviation31.972618
Coefficient of variation (CV)0.26446703
Kurtosis0.64077982
Mean120.89453
Median Absolute Deviation (MAD)20
Skewness0.1737535
Sum92847
Variance1022.2483
MonotonicityNot monotonic
2025-03-13T17:20:28.444465image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99 17
 
2.2%
100 17
 
2.2%
111 14
 
1.8%
125 14
 
1.8%
129 14
 
1.8%
106 14
 
1.8%
102 13
 
1.7%
105 13
 
1.7%
112 13
 
1.7%
95 13
 
1.7%
Other values (126) 626
81.5%
ValueCountFrequency (%)
0 5
0.7%
44 1
 
0.1%
56 1
 
0.1%
57 2
 
0.3%
61 1
 
0.1%
62 1
 
0.1%
65 1
 
0.1%
67 1
 
0.1%
68 3
0.4%
71 4
0.5%
ValueCountFrequency (%)
199 1
 
0.1%
198 1
 
0.1%
197 4
0.5%
196 3
0.4%
195 2
0.3%
194 3
0.4%
193 2
0.3%
191 1
 
0.1%
190 1
 
0.1%
189 4
0.5%

BloodPressure
Real number (ℝ)

Zeros 

Distinct47
Distinct (%)6.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean69.105469
Minimum0
Maximum122
Zeros35
Zeros (%)4.6%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:28.578389image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile38.7
Q162
median72
Q380
95-th percentile90
Maximum122
Range122
Interquartile range (IQR)18

Descriptive statistics

Standard deviation19.355807
Coefficient of variation (CV)0.28009082
Kurtosis5.1801566
Mean69.105469
Median Absolute Deviation (MAD)8
Skewness-1.843608
Sum53073
Variance374.64727
MonotonicityNot monotonic
2025-03-13T17:20:28.711313image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=47)
ValueCountFrequency (%)
70 57
 
7.4%
74 52
 
6.8%
78 45
 
5.9%
68 45
 
5.9%
72 44
 
5.7%
64 43
 
5.6%
80 40
 
5.2%
76 39
 
5.1%
60 37
 
4.8%
0 35
 
4.6%
Other values (37) 331
43.1%
ValueCountFrequency (%)
0 35
4.6%
24 1
 
0.1%
30 2
 
0.3%
38 1
 
0.1%
40 1
 
0.1%
44 4
 
0.5%
46 2
 
0.3%
48 5
 
0.7%
50 13
 
1.7%
52 11
 
1.4%
ValueCountFrequency (%)
122 1
 
0.1%
114 1
 
0.1%
110 3
0.4%
108 2
0.3%
106 3
0.4%
104 2
0.3%
102 1
 
0.1%
100 3
0.4%
98 3
0.4%
96 4
0.5%

SkinThickness
Real number (ℝ)

High correlation  Zeros 

Distinct51
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.536458
Minimum0
Maximum99
Zeros227
Zeros (%)29.6%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:28.843237image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median23
Q332
95-th percentile44
Maximum99
Range99
Interquartile range (IQR)32

Descriptive statistics

Standard deviation15.952218
Coefficient of variation (CV)0.77677549
Kurtosis-0.52007187
Mean20.536458
Median Absolute Deviation (MAD)12
Skewness0.1093725
Sum15772
Variance254.47325
MonotonicityNot monotonic
2025-03-13T17:20:28.978150image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 227
29.6%
32 31
 
4.0%
30 27
 
3.5%
27 23
 
3.0%
23 22
 
2.9%
18 20
 
2.6%
33 20
 
2.6%
28 20
 
2.6%
31 19
 
2.5%
39 18
 
2.3%
Other values (41) 341
44.4%
ValueCountFrequency (%)
0 227
29.6%
7 2
 
0.3%
8 2
 
0.3%
10 5
 
0.7%
11 6
 
0.8%
12 7
 
0.9%
13 11
 
1.4%
14 6
 
0.8%
15 14
 
1.8%
16 6
 
0.8%
ValueCountFrequency (%)
99 1
 
0.1%
63 1
 
0.1%
60 1
 
0.1%
56 1
 
0.1%
54 2
0.3%
52 2
0.3%
51 1
 
0.1%
50 3
0.4%
49 3
0.4%
48 4
0.5%

Insulin
Real number (ℝ)

High correlation  Zeros 

Distinct187
Distinct (%)24.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean79.778357
Minimum0
Maximum846
Zeros374
Zeros (%)48.7%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:29.108076image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median30.5
Q3127.25
95-th percentile293
Maximum846
Range846
Interquartile range (IQR)127.25

Descriptive statistics

Standard deviation115.24252
Coefficient of variation (CV)1.4445336
Kurtosis7.2164615
Mean79.778357
Median Absolute Deviation (MAD)30.5
Skewness2.2728864
Sum61269.778
Variance13280.837
MonotonicityNot monotonic
2025-03-13T17:20:29.242010image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 374
48.7%
105 11
 
1.4%
140 9
 
1.2%
130 9
 
1.2%
120 8
 
1.0%
100 7
 
0.9%
94 7
 
0.9%
180 7
 
0.9%
115 6
 
0.8%
135 6
 
0.8%
Other values (177) 324
42.2%
ValueCountFrequency (%)
0 374
48.7%
14 1
 
0.1%
15 1
 
0.1%
16 1
 
0.1%
18 2
 
0.3%
22 1
 
0.1%
23 2
 
0.3%
25 1
 
0.1%
29 1
 
0.1%
32 1
 
0.1%
ValueCountFrequency (%)
846 1
0.1%
744 1
0.1%
680 1
0.1%
600 1
0.1%
579 1
0.1%
545 1
0.1%
543 1
0.1%
540 1
0.1%
510 1
0.1%
495 2
0.3%

BMI
Real number (ℝ)

Zeros 

Distinct248
Distinct (%)32.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.992578
Minimum0
Maximum67.1
Zeros11
Zeros (%)1.4%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:29.372935image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile21.8
Q127.3
median32
Q336.6
95-th percentile44.395
Maximum67.1
Range67.1
Interquartile range (IQR)9.3

Descriptive statistics

Standard deviation7.8841603
Coefficient of variation (CV)0.24643717
Kurtosis3.2904429
Mean31.992578
Median Absolute Deviation (MAD)4.6
Skewness-0.42898159
Sum24570.3
Variance62.159984
MonotonicityNot monotonic
2025-03-13T17:20:29.514843image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32 13
 
1.7%
31.6 12
 
1.6%
31.2 12
 
1.6%
0 11
 
1.4%
32.4 10
 
1.3%
33.3 10
 
1.3%
32.9 9
 
1.2%
30.1 9
 
1.2%
30.8 9
 
1.2%
32.8 9
 
1.2%
Other values (238) 664
86.5%
ValueCountFrequency (%)
0 11
1.4%
18.2 3
 
0.4%
18.4 1
 
0.1%
19.1 1
 
0.1%
19.3 1
 
0.1%
19.4 1
 
0.1%
19.5 2
 
0.3%
19.6 3
 
0.4%
19.9 1
 
0.1%
20 1
 
0.1%
ValueCountFrequency (%)
67.1 1
0.1%
59.4 1
0.1%
57.3 1
0.1%
55 1
0.1%
53.2 1
0.1%
52.9 1
0.1%
52.3 2
0.3%
50 1
0.1%
49.7 1
0.1%
49.6 1
0.1%

DiabetesPedigreeFunction
Real number (ℝ)

Distinct518
Distinct (%)67.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.47172621
Minimum0.078
Maximum2.42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:29.667767image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum0.078
5-th percentile0.14035
Q10.24375
median0.3725
Q30.62625
95-th percentile1.13285
Maximum2.42
Range2.342
Interquartile range (IQR)0.3825

Descriptive statistics

Standard deviation0.33130248
Coefficient of variation (CV)0.70231944
Kurtosis5.6011474
Mean0.47172621
Median Absolute Deviation (MAD)0.1675
Skewness1.9216729
Sum362.28573
Variance0.10976134
MonotonicityNot monotonic
2025-03-13T17:20:29.965597image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.258 6
 
0.8%
0.254 6
 
0.8%
0.207 5
 
0.7%
0.268 5
 
0.7%
0.261 5
 
0.7%
0.238 5
 
0.7%
0.259 5
 
0.7%
0.304 4
 
0.5%
0.263 4
 
0.5%
0.27 4
 
0.5%
Other values (508) 719
93.6%
ValueCountFrequency (%)
0.078 1
0.1%
0.084 1
0.1%
0.085 2
0.3%
0.088 2
0.3%
0.089 1
0.1%
0.092 1
0.1%
0.096 1
0.1%
0.1 1
0.1%
0.101 1
0.1%
0.102 1
0.1%
ValueCountFrequency (%)
2.42 1
0.1%
2.329 1
0.1%
2.288 1
0.1%
2.137 1
0.1%
1.893 1
0.1%
1.781 1
0.1%
1.731 1
0.1%
1.699 1
0.1%
1.698 1
0.1%
1.6 1
0.1%

Age
Real number (ℝ)

High correlation 

Distinct53
Distinct (%)6.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.246415
Minimum21
Maximum81
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size6.1 KiB
2025-03-13T17:20:30.108504image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Quantile statistics

Minimum21
5-th percentile21
Q124
median29
Q341
95-th percentile58
Maximum81
Range60
Interquartile range (IQR)17

Descriptive statistics

Standard deviation11.759233
Coefficient of variation (CV)0.35369929
Kurtosis0.64224297
Mean33.246415
Median Absolute Deviation (MAD)7
Skewness1.1285314
Sum25533.246
Variance138.27957
MonotonicityNot monotonic
2025-03-13T17:20:30.249434image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
22 72
 
9.4%
21 63
 
8.2%
25 48
 
6.2%
24 46
 
6.0%
23 38
 
4.9%
28 35
 
4.6%
26 33
 
4.3%
27 32
 
4.2%
29 28
 
3.6%
31 24
 
3.1%
Other values (43) 349
45.4%
ValueCountFrequency (%)
21 63
8.2%
22 72
9.4%
23 38
4.9%
24 46
6.0%
25 48
6.2%
26 33
4.3%
27 32
4.2%
28 35
4.6%
29 28
 
3.6%
30 21
 
2.7%
ValueCountFrequency (%)
81 1
 
0.1%
72 1
 
0.1%
70 1
 
0.1%
69 2
0.3%
68 1
 
0.1%
67 3
0.4%
66 4
0.5%
65 3
0.4%
64 1
 
0.1%
63 4
0.5%

Outcome
Categorical

Distinct3
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size45.2 KiB
0.0
499 
1.0
266 
0.3477124183006536
 
3

Length

Max length18
Median length3
Mean length3.0585938
Min length3

Characters and Unicode

Total characters2349
Distinct characters10
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0.3477124183006536
2nd row0.0
3rd row1.0
4th row0.0
5th row1.0

Common Values

ValueCountFrequency (%)
0.0 499
65.0%
1.0 266
34.6%
0.3477124183006536 3
 
0.4%

Length

2025-03-13T17:20:30.372364image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2025-03-13T17:20:30.447320image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
ValueCountFrequency (%)
0.0 499
65.0%
1.0 266
34.6%
0.3477124183006536 3
 
0.4%

Most occurring characters

ValueCountFrequency (%)
0 1273
54.2%
. 768
32.7%
1 272
 
11.6%
3 9
 
0.4%
4 6
 
0.3%
7 6
 
0.3%
6 6
 
0.3%
2 3
 
0.1%
8 3
 
0.1%
5 3
 
0.1%

Most occurring categories

ValueCountFrequency (%)
(unknown) 2349
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
0 1273
54.2%
. 768
32.7%
1 272
 
11.6%
3 9
 
0.4%
4 6
 
0.3%
7 6
 
0.3%
6 6
 
0.3%
2 3
 
0.1%
8 3
 
0.1%
5 3
 
0.1%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 2349
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
0 1273
54.2%
. 768
32.7%
1 272
 
11.6%
3 9
 
0.4%
4 6
 
0.3%
7 6
 
0.3%
6 6
 
0.3%
2 3
 
0.1%
8 3
 
0.1%
5 3
 
0.1%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 2349
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
0 1273
54.2%
. 768
32.7%
1 272
 
11.6%
3 9
 
0.4%
4 6
 
0.3%
7 6
 
0.3%
6 6
 
0.3%
2 3
 
0.1%
8 3
 
0.1%
5 3
 
0.1%

Interactions

2025-03-13T17:20:26.859800image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:20.903702image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.837978image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.731468image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.561004image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.458616image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.221548image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.067950image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.962349image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.012450image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.981898image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.843402image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.671930image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.554571image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.326488image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.170888image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:27.076274image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.124388image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.094831image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.954340image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.776880image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.653505image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.433419image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.273830image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:27.174218image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.231325image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.207767image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.056282image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.873815image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.757446image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.554464image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.374772image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:27.266165image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.426214image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.311718image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.157224image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.963897image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.847394image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.652419image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.471901image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:27.359112image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.523158image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.408667image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.252170image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.051847image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.933345image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.746354image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.562849image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:27.459055image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.630097image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.518589image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.353111image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.148791image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.029648image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.852293image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.663791image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:27.557466image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:21.739035image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:22.621530image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:23.452055image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:24.243737image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.125593image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:25.968005image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
2025-03-13T17:20:26.759258image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/

Correlations

2025-03-13T17:20:30.513283image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
AgeBMIBloodPressureDiabetesPedigreeFunctionGlucoseInsulinOutcomePregnanciesSkinThickness
Age1.0000.1320.3500.0410.285-0.1150.2150.608-0.067
BMI0.1321.0000.2930.1420.2310.1930.2160.0000.444
BloodPressure0.3500.2931.0000.0300.235-0.0070.0930.1850.126
DiabetesPedigreeFunction0.0410.1420.0301.0000.0910.2210.114-0.0440.181
Glucose0.2850.2310.2350.0911.0000.2130.3430.1310.060
Insulin-0.1150.193-0.0070.2210.2131.0000.095-0.1270.541
Outcome0.2150.2160.0930.1140.3430.0951.0000.1630.140
Pregnancies0.6080.0000.185-0.0440.131-0.1270.1631.000-0.085
SkinThickness-0.0670.4440.1260.1810.0600.5410.140-0.0851.000

Missing values

2025-03-13T17:20:27.816317image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
A simple visualization of nullity by column.
2025-03-13T17:20:27.919259image/svg+xmlMatplotlib v3.10.1, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

PregnanciesGlucoseBloodPressureSkinThicknessInsulinBMIDiabetesPedigreeFunctionAgeOutcome
0614872350.033.60.62750.0000000.347712
118566290.026.60.35131.0000000.000000
281836400.023.30.67232.0000001.000000
3189662394.028.10.16721.0000000.000000
401374035168.043.12.28833.0000001.000000
551167400.025.60.20130.0000000.000000
6378503288.031.00.24826.0000001.000000
710115000.035.30.13433.2464150.000000
821977045543.030.50.15853.0000001.000000
981259600.00.00.23254.0000001.000000
PregnanciesGlucoseBloodPressureSkinThicknessInsulinBMIDiabetesPedigreeFunctionAgeOutcome
75811067600.037.50.19726.00.0
75961909200.035.50.27866.01.0
760288582616.028.40.76622.00.0
761917074310.044.00.40343.01.0
7629896200.022.50.14233.00.0
763101017648180.032.90.17163.00.0
764212270270.036.80.34027.00.0
76551217223112.026.20.24530.00.0
76611266000.030.10.34947.01.0
76719370310.030.40.31523.00.0